Improved prediction of accessible surface area results in efficient energy function application.
نویسندگان
چکیده
An accurate prediction of real value accessible surface area (ASA) from protein sequence alone has wide application in the field of bioinformatics and computational biology. ASA has been helpful in understanding the 3-dimensional structure and function of a protein, acting as high impact feature in secondary structure prediction, disorder prediction, binding region identification and fold recognition applications. To enhance and support broad applications of ASA, we have made an attempt to improve the prediction accuracy of absolute accessible surface area by developing a new predictor paradigm, namely REGAd(3)p, for real value prediction through classical Exact Regression with Regularization and polynomial kernel of degree 3 which was further optimized using Genetic Algorithm. ASA assisting effective energy function, motivated us to enhance the accuracy of predicted ASA for better energy function application. Our ASA prediction paradigm was trained and tested using a new benchmark dataset, proposed in this work, consisting of 1001 and 298 protein chains, respectively. We achieved maximum Pearson Correlation Coefficient (PCC) of 0.76 and 1.45% improved PCC when compared with existing top performing predictor, SPINE-X, in ASA prediction on independent test set. Furthermore, we modeled the error between actual and predicted ASA in terms of energy and combined this energy linearly with the energy function 3DIGARS which resulted in an effective energy function, namely 3DIGARS2.0, outperforming all the state-of-the-art energy functions. Based on Rosetta and Tasser decoy-sets 3DIGARS2.0 resulted 80.78%, 73.77%, 141.24%, 16.52%, and 32.32% improvement over DFIRE, RWplus, dDFIRE, GOAP and 3DIGARS respectively.
منابع مشابه
A simple and efficient plasticity-fracture constitutive model for confined concrete
A plasticity-fracture constitutive model is presented for prediction of the behavior of confined plain concrete. A three-parameter yield surface is used to define the elastic limit. Volumetric plastic strain is defined as hardening parameter, which together with a nonlinear plastic potential forms a non-associated flow rule. The use of non-associated flow rule improves the prediction of the dil...
متن کاملIntroducing critical residues in the human prion protein and its Asp 178 Asn mutant by molecular dynamics simulation
The molecular dynamics (MD) simulation method is used to assess structural details for humanprion protein (hereafter PrPN) and its Asp178 Asn mutant (hereafter PrPm) which causes fatalfamilial insomnia disease. The results reveal that the flexibility and instability increase in PrPmcould be related to specific amino acids exposed to the solvent. Solvation free energy of PrPm is 20kjmot1nni2 mor...
متن کاملA New Surface Tension Model for Prediction of Interaction Energy between Components and Activity Coefficients in Binary Systems
In this work, we develop a correlative model based on the surface tension data in order to calculate thermodynamic parameters, such as interaction energy between components (Uij), activity coefficients and etc. In the new approach, by using Li et al. (LWW) model, a three-parameter surface tension equation is derived for liquid mixtures. The surface tension data of 54 aqueous and 73 non-aqueous ...
متن کاملDeveloping a dynamic yield and growth model for saffron under different irrigation regimes
Better irrigation management and more efficient management of crop production require modeling of plant growth and crop yield. More applicable models are usually simple and requires less and accessible inputs. The objective of this study was to develop a model for growth and yield prediction of saffron under various irrigation regimes. In this modeling soil water budget and other simple rel...
متن کاملComputationally Efficient Long Horizon Model Predictive Direct Current Control of DFIG Wind Turbines
Model predictive control (MPC) based methods are gaining more and more attention in power converters and electrical drives. Nevertheless, high computational burden of MPC is an obstacle for its application, especially when the prediction horizon increases extends. At the same time, increasing the prediction horizon leads to a superior response. In this paper, a long horizon MPC is proposed to c...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Journal of theoretical biology
دوره 380 شماره
صفحات -
تاریخ انتشار 2015